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Abstract. We prove a quantitative version of the Polynomial Szemeredi Theorem for difference sets. This 
result is achieved by first establishing a higher dimensional analogue of a theorem of Sarkozy (the simplest 
non-trivial case of the Polynomial Szemeredi Theorem asserting that the difference set of any subset of the 
integers of positive upper density necessarily contains a perfect square) and then applying a simple lifting 
argument. 



An earlier version of this article, in which the main results were restricted to the case of linearly independent 
polynomials, has already appeared in [8]. The main substantive changes in this revision occur in the statements of 
Theorem\l\ and Corollary\^ which are obtained through a simple modification of the original argument in Section \2.1\ 

1. Introduction 

1.1. Background. A striking and elegant result in density Ramsey theory states that in any subset of the 
integers of positive upper density there necessarily exist two distinct elements whose difference is a perfect 
square. 

This result was originally conjectured by L. Lovasz and eventually verified independently by Furstenberg 
[3], using techniques from ergodic theory, and Sarkozy [13] . using an approach similar in spirit to Roth's 
Fourier analytic (circle method inspired) proof of Szemeredi's theorem for arithmetic progressions of length 
three. 

Sarkozy actually obtained the following stronger quantitative result. 

Theorem A (Sarkozy 13J). If A C [1,N] and (P ^ A — A for any d ^ 0, then there exists an absolute 
constant C > such that 

\A\ ^^^(loglogiV)2xi/3 



Notation: In the theorem above and in the sequel we will use N (and later also M) to denote an arbitrary 
positive integer, [1, A''] to denote {1, . . . , N} as is customary, and A± A to denote the usual difference and 
sum sets of A, namely A zL A = {a ± a' \ a, a' G A} . 

The current best known quantitative bound of (log A^)^'^'°s'°siogiogAf Theorem A is due to Pintz, 
Steiger and Szemeredi [10]. These methods were later extended by Balog, Pelikan, Pintz and Szemeredi 
[1] to obtain the same bounds, with the implicit constant now depending on fc, for sets with no fcth power 
differences. 

We note that it is conjectured that for any e > and N > No{e) sufficiently large there exists a set 
A C [1,N] with 1^1 > N^~'^ that contains no square differences, see for example jB]. Ruzsa [TTl has 
demonstrated that this is at least true for e = 0.267. 

Bergelson and Leibman (extending on the ideas of Furstenberg) established a far reaching qualitative 
generalization of Sarkozy's theorem, the so-called Polynomial Szemeredi Theorem. 

Theorem B (Bergelson and Leibman [2]). If A is a subset of the integers of positive upper density and 
Pi{d), . . . ,Pi{d) are polynomials in Z[d] with Pi{0) = for i = 1, . . . , £, then there exists a G A and d ^ 
such that 

a+{Piid),...,Piid)} C A. 
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We note that no quantitative version of this multiple recurrence result is known beyond the linear case of 
Szemeredi's theorem, see Gowers [Ij, and the general single recurrence case of Sarkozy's theorem (the case 
£ = 2 above). 

The purpose of this paper is to establish a quantitative result on the existence of polynomial configurations 
of the type in Theorem B in the difference set of sparse subsets of the integers. 

1.2. Statement of Main Results. We now fix a family of polynomials 

Pi(d),...,P,(d) eZ[d] 

with Pi{0) = for i = 1, . . . ,£ and set k = max; deg Pi. We will assume throughout this paper that fc > 2. 
Theorem 1. If A C [1, N] and {Pi{d), . .. ,Pe{d)} ^ A — A for any d^Q, then we necessarily have 

1^1 ^ / (loglogiV)^ ^^/^'^-^^ 
N - \ logN 

for some absolute constant C — C(Pi, . . . , P^). 

In the case of a single polynomial {£ — 1), this result has also recently been obtained by Lucier [7] and, to 
the best of our knowledge, constitutes the best bounds that are currently known for arbitrary polynomials 
with integer coefficients and zero constant term. 

We remark that one can immediately deduce, from Theorem [1] the following result on the existence of 
polynomial configurations in (a shift of) the sumset A + B oi two given sets A, B C [1, N]. 

Corollary 2. If A, B C [l,N] and m + {Pi(d), . . . , Peid)} A + B for any d ^ and m G [2,27V], then we 
necessarily have 

- \ \ogN 
for some absolute constant C = C'{Pi, . . . , Pe). 



Proof. Since 



^Bn{m-A)\ = \A\\B\ 



2<m<2N 



it follows that there exists m G [2, 2iV] such that if we set D — B n {m ~ A), then 

l^ll^l 
' - 2iV - 1 

The result then follows from Theorem [T] since 

D - D C A + B -m. □ 

The strategy we will employ to prove Theorem [1] is to lift the problem to Z'^ in such a way that we may 
then apply the following higher dimensional analogue of Sarkozy's theorem. 

Theorem 3. If B C [1, N]'' and {d,cP , . . . ,d^) i B - B for any d then we necessarily have 



N'' - \ logN 
for some absolute constant C — C{k). 

Since Theorem[3]is concerned with the intersection of a difference set with the monomial curve {d,d^ , . . . , d^) 
we speculate that the methodology of Balog et al. pP may be applied in this higher dimensional situation 
to obtain far superior bounds in Theorem [3] and hence also in Theorem [Ij 

Further notational convention: Throughout this paper the letters c and C will denote absolute constants 
that will generally satisfy < c <C 1 <C C, whose values may change from line to line and even from step to 
step, and will unless otherwise specified depend only on the dimension k. 
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2. Reduction to the key dichotomy proposition 
We first present the lifting argument that aUows us to deduce Theorem [T] from Theorem [3l 

2.1. Proof that Theorem [3] implies Theorem [TJ Let P,{d) = c.idH \- c^kd'' for 1 < i < 

Suppose that the coefficient matrix V = {c,;j} has rank; r with 1 < r < £. Without loss in generahty 
we will make the additional assumption that it is in fact the first r polynomials Pi , . . . , that are linearly 
independent and use TZ to denote the r x k matrix corresponding to the first r rows of V. 

As a consequence of this assumption it follows that the remaining polynomials, Pr+i with I < i < £ — r, 
can be expressed as 

Pr+i = diiPi + ■ • • + dirPr 

where V = {dij} is some {£ ~ r) x r matrix with rational coefficients. 
Note that 

P : Z'' ^ 

7^ : z*-' ^ r' 

and 

\v{n{h))) ■ 

Let = ^ X ^ X • • • X A C [1, iV]^ and (5 = \A\/N. 

The full rank assumption on the matrix TZ ensures that there exists an absolute constant c, depending 
only on the coefficients of the matrix TZ, such that 

\n{z^) n (A'' - s)| > cS^'N'' 

for some s G [1, c~^Y . Thus, if we choose N' to be a large enough multiple of N (again depending only the 
coefficients of the matrix TV) and let 

B' ={be [-N', N'f : TZ{b) e A"- - s} , 

it follows that 

\B'\ > cy'N''. 

Since 

tez*-'- fcG-B' 

it follows that there exists c = c(P) and t G Z^^'' such that 

|{6eP' : V{n{b)) eA^-"- ^t} \ >cS^-'-\B'\. 

Hence, if we let 

B={be [~N', Nf : Vib) £ A^ - m] , 
where m= {s,t) it follows that 

\B\ > c5^N^. 

The result now follows from Theorem [3] since if there were to exist a d 7^ such that 

eB-B 

this would immediately implies that 

(Pl(d),...,P,(d))eA^-A^ 

since P(P) C A'^ - m. □ 
Matters therefore reduce to proving Theorem [S] 
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2.2. Dichotomy between randomness and arithmetic structure. Our approach will be to establish 
a dichotomy between randomness and structure of the following form. 

Let us fix the notation Qm = [1, M] x ■ ■ ■ x [1, M*^] and e — (lOfc)"^. 

Proposition 4. Let B C Qm , ^ ^ \B\/\Qm\, and a = Ck5^^'^ / log 5'^ . If M > , with C > sufficiently 
large ( depending only on k ), then either B behaves as though it were a random set in the sense that 

M 

(1) ^|Bn(B+(d,d^...,d'=))| > '-S\B\M 

d=l 

or B has arithmetic structure in the sense that there exists a grid A C Qa/ of the form 

(2) A = {m + (iiq, . . . , 4g'=) I (4, . . . , 4) e Ql} 
with L > such that 

|BnA| > 5{l + n)\K\. 

In contrast with the standard L°° increment strategy of Roth, we will obtain the dichotomy in Proposition 
2] by exploiting the concentration of the mass of the Fourier transform. Similar arguments of this type can 
be found in Heath-Brown [6J and Szemeredi [15], see also Ruzsa and Sanders 12]. The proof of Proposition 
IHwill be presented in Sections [3] and H) 

2.3. Proof that Proposition |4] impUes Theorem [3l It is easy to see, by partitioning [1,^^]*^ into boxes 
of size M X x ■ ■ ■ x with M essentially equal to N^^^ , that we may, with no loss in generality, assume 
that B C Qm with 5 = \B\/\Qm\ > \B\/N''. 

li {d,d^, . . . ,d'^) ^ B — B ior any d ^ (as is the assumption in Theorem [3]), then Proposition fallows us 
to perform an iteration. At the nth step of this iteration we will have a set _B„ C Qm„ of size Sn\QM„\^ this 
set will be an appropriately rescaled version of a subset of B itself and hence will also contain no non-trivial 
differences of the form {d, d"^ , . . . , d^) . 

Let Bq = B, Mq = M and 5o = 6. Proposition |4] ensures that either 

(3) Mn < Sn"" 

or else the iteration proceeds allowing us to choose Af„+i, (5„+i and Bn+i such that 

M„+i >c,5i2'=+i)Af„/log<5-i 

and 

Now as long as the iteration continues we must have (5„ < 1, and so after 0(6^^'' \ogS^^) iterations 
condition ([3]) must be satisfied, giving 

From this it follows that 

log A// < CS-'^''-^\logS-^f 
and consequently (after a short calculation that we leave to the reader) that 

(loglogA//)2Ni/("-^) 



logAf 

This establishes Theorem |31 □ 
The rest of this article is devoted to the proof of Proposition |4l 

3. Setting the stage for the proof of Proposition [4] 

We suppose that B C Qm, <5 — |-B|/|Qm|, and Af > 6^^ . Our approach will be to assume that B exhibits 
neither of the two properties described in Proposition |4] and then seek a contradiction. 
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3.1. A simple consequence of B being non-random. If we were to suppose that B is non-random, in 
the sense that inequahty ([1]) does not hold, then it would immediately follows that 

(4) J2 lBim)lBin)lsim-n)<^S\B\\S\ 
where 

5' = : 1 <d<eM}. 

3.2. A simple consequences of B being non-structured. If we were to assume that B is regular, in 
the sense that B in fact satisfies the inequality 

|BnA| < S{l + (j)\A\ 

for all arithmetic grids A C Qj^.j of the form ^ with L > S^^'^aM , then the set 

B' = Bn {{sM, (1 - s)M] X • • ■ X (eA./^ (1 - £)M'=]) 

must contain most of the elements of _B. In particular we must have 

(5) \B'\ > (3/4)151 

since if this were not the case we would immediately obtain a grid A C Q^/ of the form ([2]) with q = I and 
L > eM such that 

|BnA| >(5(1 + 1/4)|A|. 

3.3. The balance function. We define the balance function of B to be 

fs = ~ <51Qm7 

and note that /b has mean value zero, that is ^ = 0. This property of the balance function fs will 

be critically important in our later arguments. 

It easy to verify that if B satisfies inequalities ([!]) and ([5]), then 

(6) fB{m)fBin)lsim-n)<-^S\B\\S\. 

One can see this by simply expanding the sum into a sum of four sums, one involving only the function Ig 
on which we can apply two involving the functions 1b and —SIqj^j on which we can apply ([5]), and one 
involving only the function — (51q„ which can be estimated trivially. 

3.4. Fourier analysis on Z'^. For / : Z''' ^ C with finite support we define the Fourier transform of f to 
be 

f{a) = J2 /Me-'"™-"- 

The finite support assumption on / ensures that / is a continuous function on T*^ and that orthogonality 
immediately gives both the Fourier inversion formula and Plancherel's identity, namely 

/(m)=/ /(a)e2™da and / \fia)\'da=y^\f{m)\^ 

It is then easy to verify that from inequality ([6]) we immediately obtain the estimate 

(7) / \TB{a)\'\rsia)\da>\s\B\\S\ 

Jfk 4 

where we recognize 

eM 

(8) Ts{a) = e-2-»("i'i+"2d^+-+afc<i'=)^ 

d=l 

as a classical Weyl sum. 
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3.5. Estimates for Weyl sums. Since e — (lOfc)^^ is fixed it is clear that whenever \aj\ ^ M^^ there 
can be no cancellation in the Weyl sum ([5]), in fact the same is also true when each is close to a rational 
with small denominator (in other words there is no cancellation over sums in residue classes modulo q). 

We now state a precise formulation of the well-known fact that this is indeed the only obstruction to 
cancellation. Let 77 > 0. We define 



(9) = M,(?7) ^iaer : a, - ^ < -j— {1 < j < k) for some a e [1, q] 



1 



Lemma 5. Let 77 > and M > rj ^' (with C sufficiently large depending on k). 
(i) (Minor box estimate) If a ^ Mg for any 1 < q < rj~^ , then 

\ls{a)\<Cii\S\. 

•-q for 

\ls{a)\<Cq-'"'\S\. 



(ii) (Major box estimate) If a & for some I < q < rj , then 



The proof of this result is a straightforward (and presumably well-known) consequence of the standard 
estimates for Weyl sums, for the sake of completeness we include these arguments in Appendix A. 

4. The proof of Proposition [4] 

In the previous section we established that inequalities Q and (O would be immediate consequences of 
B not exhibiting either of the two properties described in Proposition [4] We now present the two lemmas 
from which we will obtain our desired contradiction. 

In both lemmas below we set rj — 6/8C, where C is the large constant in Lemma [S] 

Lemma 6. Let 77 — 6/8C. If B is neither random nor structured, in the sense outlined in Proposition^ 
then there exists 1 < q < rj^^ such that 

(10) -i- / \TB{a)\'da>c5^-'/\og5-\ 

The second lemma is a precise quantitative formulation, in our setting, of the standard density incre- 
ment lemma. 

Lemma 7. Let 77 = 5/8C and a < rf^'^/^TT. If B is regular, in the sense that 

|BnA| < 5{l + a)\K\ 
for all arithmetic grids A C Qj^j of the form 0) with qL > rf'aM , then 

(11) / |/^(a)pda<12a. 

We therefore obtain a contradiction if < c5^^^ / log (5^^, proving Proposition 21 
4.1. Proof of Lemma (6], It follows from the minor box estimate of LemmalHland Plancherel's identity that 

\TB{a)\''\Ts{a)\da<Ci^\S\\B\. 



minor boxes 



Therefore, if 77 = S/SC^ it follows from estimate (O and the major box estimate of Lemma that 

A simple counting argument (which we leave to the reader) then allows us to conclude that there exists a 
1 < q < 7]^'^ such that 

\fB{a)\''da> 



I 



log 77-'= 
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as required. □ 

4.2. Proof of Lemma [71 We fix q and L so that qL ~ rf'aM and define 

A = {-(4g,^2g',...,4(?')|l <i^}. 
Claim 1. IJae Mg, then |U(a)| > |A|/2. 

Proof of Claim [IJ Since 

for all a G Mg, where || • || denotes the distance to the nearest integer, it follows that 

LJ k 

|U(a)| > |A| - ^|e2^»(^i9"i+-+^.<z'=a.) „ i| > |A|(^l _27r^L^'||(7^aj-l|) > |A|/2, 

lj = l 3 = 1 

for aU a G Mg, provided a < rf^"^ /Sir. □ 

Plancherel's identity (applied to the function fs * 1a) and Claim [1] imply that 
(12) ^ < ^ E * IaMI^- 

The conclusion of Lemma [7] will therefore be an immediate consequence of the following. 
Claim 2. As a consequence of the assumptions in Lemma^ if follows that 

^ \fB*lA{m)\^ <3<yS\B\\A\\ 

Proof of Claim\M We let 

M = {m G Z'^' |m- A C Qm} 
£ = {Qm + A)\M 

and write 

E l/i3*lAMI'= E l/s*lA(m)|2+ ^ |/B*lA(m)|2. 

We note that since 

fB * lA(m) = |B n (m - A)| - S\Qm n (m - A)| 
it follows from our regularity assumption on B that if m G Al , then 

-6\A\ < fB * IaM < 6a\A\, 
while for m G f we can only conclude that 

|/s*1aMI < |A|. 
Now since /s has mean value zero the convolution 

fs * lA{m) = E fBin)lA{m - n) 

n 

also has mean value zero. Thus, using the fact that \g\ = 2g+ — g, where g+ = max{g,0} denotes the 
positive-part function, and the trivial size estimate \M\ < |Qm|, we can deduce that 

E \fB * lA(m)|2 < 2 sup \fB * IaMI * 1a)+(to) 

meM \meM J 

<2(b\A\){ba\A\)\M\ 
<25V|AnQM|. 
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We leave it to the reader to verify that 



£\ < ((1 + 277^)^ - (1 - 2ij^ar) \Qm\ < ^kifa\Q 



M 



and hence 



provided 8kri^ ^ S^. 

This concludes the proof of Claim [2] and establishes Lemma [71 



□ 



Appendix A. Weyl sum estimates 
A.l. Standard major and minor arc estimates. Let P{a, d) = aid + ■ • • + akd^ . 
Lemma 8 (Weyl inequality). If\ak — aukll\ < 9 ond {a.q) = 1, then 



N 



,2TriP{a,d) 



1/2*= 



This result is completely standard, see for example |9j. We now fix a sufficiently small fj, — ^{k) > and 
define 



(13) 



Ml/, = ^ a G T'^- 



< 



1 



(1 < J < k) 



Successive applications of Dirichlet's principle and the Weyl inequality, starting with the highest power k, 
gives the following qualitative estimate (a quantitative version of which can be found in Vinogradov [17j). 

Proposition 9 (Minor arc estimate I). If a ^ '^'a/q f'^''' '^^y ^ ^ 1 < <? < N^^, then 

N 
d=l 



(14) 



for some v — v(k^ /i) > 0. 

Proposition 10 (Major arc estimate). If a € '^'a/q some {a,q) = 1 with I <q < N^, then 



(15) 



N 

E' 

d=l 



,2-iTiP{a,d) 



— Ilk 

< CNq-^''' (l + E - + 0{N^'^). 



Proof. It is straightforward to write 

N 

E e^"'^^"''') = q-^S{a, q)vN{a ~ a/q) + 0{N^'^) 



where 



9-1 fN 

5(a,g) :-E^'"''^°'''^^' and vnW) := e^^'^^^'^Ux 



r=0 

The result then follows from the observation that 

(16) \S{a,q)\<Cq'~^/'' 

whenever (a, q) — 1, which is a result of Hua (see for example [16 ), and 



fe 



(17) 



which follows from van der Corput's lemma for oscillatory integrals (see for example 14 ) and rescaling. □ 
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A. 2. Refinement of the major arcs. Let < < 1 and 



(18) 



M 



a, 



< 



1 



(1 < i < k) 



Combining Propositions[9]and[Tn]we easily obtain the following result from which Lemma[5]is an immediate 
consequence. 

Proposition 11 (Minor arc estimate II). If a ^ M^/^ for any {a,q) ~ 1 with I < q < Tj^^ , then 



N 

^2iriP{a,d) 



< CriN + 0{N^-''). 



Proof. It follows from Proposition [TO] that on M^^^ we have 



N 



,2TriP{a,d) 



< Ct]N 



provided {a,q) = 1 and either 
or there exists j such that 



Tj-''N-^ < \aj - aj/q\ < N 



□ 



[6: 
[7: 

[9 

[lo- 
in 
[12: 

[13: 

[14 

[is; 
[16; 
[ir 
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